Parallel Branch Prediction on GPU Platform
نویسندگان
چکیده
Branch Prediction is a common function in nowadays microprocessor. Branch predictor is duplicated into multiple copies in each core of a multicore and many-core processor and makes prediction for multiple concurrent running programs respectively. To evaluate the parallel branch prediction in many-core processor, existed schemes generally use a parallel simulator running in CPU which does not have a real passive parallel running environment to support a many-core simulation and thus has bad simulating performance. In this paper, we firstly try to use a real many-core platform, GPU, to do a parallel branch prediction for future general purpose many-core processor. We verify the new GPU based parallel branch predictor against the traditional CPU based branch predictor. Experiment result shows that GPU based parallel simulation scheme is a promising way to faster simulating speed for future many-core processor research.
منابع مشابه
Gpu Accelerated Parallel Branch Prediction for Multi/many-core Processor Simulation
Branch Prediction is a common function in nowadays microprocessors. Branch predictor is duplicated in each core of a multi/many-core processor and makes prediction for multiple concurrent running programs respectively. To evaluate the parallel branch prediction in a multi/many-core processor, existing schemes generally use a parallel simulator running on a CPU that does not have a real massive ...
متن کاملParallel Implementation of Particle Swarm Optimization Variants Using Graphics Processing Unit Platform
There are different variants of Particle Swarm Optimization (PSO) algorithm such as Adaptive Particle Swarm Optimization (APSO) and Particle Swarm Optimization with an Aging Leader and Challengers (ALC-PSO). These algorithms improve the performance of PSO in terms of finding the best solution and accelerating the convergence speed. However, these algorithms are computationally intensive. The go...
متن کاملIsolated Persian/Arabic handwriting characters: Derivative projection profile features, implemented on GPUs
For many years, researchers have studied high accuracy methods for recognizing the handwriting and achieved many significant improvements. However, an issue that has rarely been studied is the speed of these methods. Considering the computer hardware limitations, it is necessary for these methods to run in high speed. One of the methods to increase the processing speed is to use the computer pa...
متن کاملGPU-Based Parallel Researches on RRTM Module of GRAPES Numerical Prediction System
GRAPES (Global and Regional Assimilation and Prediction System) is a new generation of numerical weather prediction (NWP) system of China. As the system processes amount of data and requires high real-time,so it is always a hot research field of parallel computing.This is the first time that we use GPU (Graphics Processor Unit) general-purpose computing and CUDA technology on RRTM (Rapid Radiat...
متن کاملFIPIP: A novel fine-grained parallel partition based intra-frame prediction on heterogeneous many-core systems
Intra-frame prediction is an important time-consuming component of the widely used H.264/AVC encoder. To speed up prediction, one promising direction is to introduce parallelism and there have been many heterogeneous many-core based approaches proposed. But most of these approaches are limited by their use of highly irregular prediction formulas, which require significant amount of branch instr...
متن کامل